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Title : A marke r specific for Escherichia coli serotypes Q157:H7; Q157:NML 

^and Q55:H7 
FIELD OF THE INVENTION 
5 This invention relates to a novel DNA marker specific for E. 

coli serotypes 0157:H7, 0157;NM and 055:H7 and the use of the marker in 
developing assays to detect these serotypes of E. coli in a sample. 
BACKGROUND OF THE INVENTION 

£. coli 0157:H7 is a food-borne human pathogen causing a 

10 spectrum of diseases including diarrhea, hemorrhagic colitis (HC) and 
hemolytic uremic syndrome (HUS). Development of nucleic-based assays 
for rapid detection of E. coli 0157:H7 has been challenging due to a lack of 
specific DNA markers for this organism. The reported DNA markers for 
E. coli 0157:H7 have one or more of the following limitations or 

15 drawbacks: being non-specific, so that multiplex PCR assays need to be 
used; and containing only one base pair mutation, limiting their use in 
assay development. Consequently, there is a need in the art to provide 
novel markers for E. coli 0157:H7 and related serotypes. 
SUMMARY OF THE INVENTION 

20 The present inventors have prepared a novel marker specific 

for E. coli serotypes 0157:H7, 0157:NM and 055:H7. The DNA sequence of 
the marker has a total of 1583 nucleotides and has no significant homology 
to any known DNA sequences. Accordingly, in one aspect, the present 
invention provides an isolated nucleic acid sequence comprising the 

25 sequence shown in Figure 1 (SEQ.ID.NO.:l) or a diagnostic fragment 
thereof. 

The marker of the invention can be used to develop probes or 
primers that can be used to detect E. coli serotypes 0157:H7; 0157:NM and 
055:H7 in a sample. In particular, the marker has been demonstrated to be 
30 useful in designing primers for PCR assays for specific detection of the E. 
coli serotypes. As the specific DNA sequences of this invention are not 
homologous to previously known sequences, various specific PCR assays 
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can be developed with only one primer pair. In addition, other nucleic 
acid-based assays such as DNA chip or biosensor assays can also be 
developed without the restriction in using a very limited region of a 
marker. Therefore, the invention can be used to develop nucleic acid 
5 based assays to detect the E. coli serotypes 0157:H7, 0157;NM and 055:H7. 

Other features and advantages of the present invention will 
become apparent from the following detailed description. It should be 
understood, however, that the detailed description and the specific 
examples while indicating preferred embodiments of the invention are 

10 given by way of illustration only, since various changes and modifications 
within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 
BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the 

15 drawings in which: 

Figure 1 and SEQJD.NO.:! shows the nucleotide sequence of 
the 1583 bp marker of the invention* 

Figure 2 and SEQ JD.NO»2 shows the nucleotide sequence of a 
360 bp fragment of the marker* 

20 Figure 3 is a gel electrophoresis of PGR products amplified 

using the PCR assay from DNA preparations of representative E. coli 
serotypes: lanes 1, 2, 0157:H7; lanes 3, 4, 0157.NM; lanes 5 - 9, 0145:NM; 
lanes 10, 11, 055:H7; lane 12, 026:H11; lane M, 100 bp DNA ladder. 
DETAILED DESCRIPTION OF THE INVENTION 

25 L MARKER 

As hereinbefore mentioned, the present inventors have 
prepared a novel DNA marker specific for E, coli serotypes 0157:H7, 
0157:NM and OS5:H7. 

The marker was identified and isolated using a relatively new 
30 marker technology, fluorescent amplified fragment length polymorphism 
(FAFLP) and DNA sequencing. The DNA sequence of the marker has a 
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total of 1583 nucleotides and has no significant homology to any known 
DNA sequences. 

Accordingly, in one aspect, the present invention provides an 
isolated nucleic acid molecule comprising the sequence shown in Figure 1 
5 (SEQ.IDJNFO.:l), wherein T can also be U, or a diagnostic fragment thereof. 
The term " isolated" refers to a nucleic acid molecule 
substantially free of cellular material or culture medium when produced 
by recombinant DNA techniques or chemical precursors when chemically 
synthesized. The term "nucleic acid" includes deoxyribonucleic acid 

10 (DNA) and ribonucleic acid (RNA) and can either be double stranded or 
single stranded. 

The term "diagnostic fragment" means any fragment of the 
marker shown in Figure 1 (SEQJD.NO.rl) that is useful in a diagnostic 
assay to detect E. coli serotypes 0157:H7; 0157:NM and 055M7. The 

15 diagnostic fragment includes fragments that can be used as primers in PCR 
assays and fragments that can be used as probes in detection assays. The 
diagnostic fragments of the invention will not cross react with bacteria 
other than E. coli serotypes 0157:H7; 0157:NM and 055:H7. 

The invention also includes (1) a nucleic sequence that is 

20 complimentary to the sequence shown in Figure 1 (SEQJD.NO.:l) or a 
diagnostic fragment thereof; (2) a nucleic sequence that can hybridize to the 
sequence shown in Figure 1 (SEQJD.NO.:l) or a diagnostic fragment 
thereof; (3) a nucleic acid sequence which has substantial sequence 
homology to (1) or (2); and (4) a nucleic acid sequence which is an analog of 

25 the nucleic acid sequences of (1) to (3). Such sequences are also useful in 
developing diagnostic assays to detect E. coli serotypes 0157:H7; 0157:NM 
and 055:H7. 

The term "a nucleic acid sequence which has substantial 
sequence homology" means a nucleic acid sequence which has slight or 
30 inconsequential sequence variations from the sequence shown in Figure 1 
(SEQJD.NCX:1). For example, one skilled in the art can appreciate that 
variations can be made to the sequence of Figure 1 (SEQJD.NO.:!) and still 
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permit the use of the sequence in detecting the E. coli serotypes of the 
invention. In addition, natural variations may exist in the sequence of 
certain isolates which may be attributable to local mutations or structural 
modifications. Nucleic acid sequences having substantial homology 
5 include nucleic acid sequences having at least 85%, preferably 90-95% 
identity with the nucleic acid sequence as shown in Figure 1 
(SEQmNCXrl). 

The term "a nucleic acid sequence that can hybridize" means 
a nucleic acid sequence that can hybridize to a nucleic acid sequence shown 

10 in Figure 1 (SEQJD.NCXrl) or a diagnostic fragments thereof under 
hybridization conditions, preferably stringent hybridization conditions. 
Appropriate stringency conditions which promote DNA hybridization are 
known to those skilled in the art, or may be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 63.1-63.6. For 

15 example, the following may be employed: 6.0 x sodium chloride /sodium 
citrate (SSC) at about 45°C, followed by a wash of 2.0 x SSC at 50 6 C. The 
stringency may be selected based on the conditions used in the wash step. 
For example, the salt concentration in the wash step can be selected from a 
high stringency of about 0.2 x SSC at 50°C. In addition, the temperature in 

20 the wash step can be at high stringency conditions, at about 65°C. 

The term "a nucleic acid sequence which is an analog" means 
a nucleic acid sequence which has been modified as compared to the 
sequence shown in Figure 1 (SEQ.ID.NO.:l) or a diagnostic fragment 
thereof wherein the modification does not alter the utility of the sequence 

25 (i.e. to detect E. Coli serotypes 0157:H7; 0157:NM and 055:H7 or to develop 
probes, primers or microchips to detect the serotypes) as described herein. 
The modified sequence or analog may have improved properties over the 
sequence shown in Figure 1 (SEQJD.NO.:l) or a diagnostic fragment 
thereof. One example of a modification to prepare an analog is to replace 

30 one of the naturally occurring basis (i.e. adenine, guanine, cytosine or 
thymidine) of the sequence shown in Figure 1 (SEQ.ID.NO.:l) with a 
modified base such as such as xanthine, hypoxanthine, 2-aminoadenine, 6- 
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methyl, 2-propyl and other alkyl adenines, 5-halo uracil, 5-halo cytosine, 6- 
aza uracil, 6-aza cytosine and 6-aza thymine, pseudo uracil, 4-thiouracil, 8- 
halo adenine, 8-aminoadenine, 8-thiol adenine, 8-thiolalkyl adenines, 8- 
hydroxyl adenine and other 8-substituted adenines, 8-halo guanines, 8- 
5 amino guanine, 8-thiol guanine, 8-thiolalkyl guanines, 8-hydroxyl 
guanine and other 8-substituted guanines, other aza and deaza uracils, 
thymidines, cytosines, adenines, or guanines, 5-trifluoromethyl uracil and 
5-trifluoro cytosine. 



10 phosphorous or oxygen heteroatoms in the phosphate backbone, short 
chain alkyl or cycloalkyl intersugar linkages or short chain heteroatomic or 
heterocyclic intersugar linkages in the nucleic acid molecule shown in 
Figure 1 (SEQ.ID.NO.:l). For example, the nucleic acid sequences may 
contain phosphorothioates, phosphotriesters, methyl phosphonates, and 

15 phosphorodithioates. 



the invention is a peptide nucleic acid (PNA) wherein the deoxyribose (or 
ribose) phosphate backbone in the DNA (or RNA), is replaced with a 
polyamide backbone which is similar to that found in peptides (P.E. 

20 Nielsen, et al Science 1991, 254, 1497). PNA analogs have been shown to be 
resistant to degradation by enzymes and to have extended lives in vivo 
and in vitro. PNAs also bind stronger to a complimentary DNA sequence 
due to the lack of charge repulsion between the PNA strand and the DNA 
strand. Other nucleic acid analogs may contain nucleotides containing 

25 polymer backbones, cyclic backbones, or acyclic backbones. For example, 
the nucleotides may have morpholino backbone structures (U.S. Pat. No. 
5,034,506). The analogs may also contain groups such as reporter groups, a 
group for improving the pharmacokinetic or pharmacodynamic properties 
of nucleic acid sequence. 

30 Other analogs include isolated nucleic acid sequences having 

sequences which differ from the nucleic acid sequence shown in Figure 1 
(SEQ.ID.NO.:!) due to degeneracy in the genetic code are also within the 



Another example of a modification is to include modified 



A further example of an analog of a nucleic acid molecule of 
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scope of the invention. Such nucleic acids encode functionally equivalent 
proteins but differ in sequence from the above mentioned sequences due 
to degeneracy in the genetic code. 

The present invention further includes the preparation or 
5 isolation of other nucleic acid sequences which are the same, analogous, 
homologous or can hybridize to the nucleic acid sequences of the 
invention. For example, an isolated nucleic acid molecule of the 
invention which comprises DNA can be isolated by preparing a labelled 
nucleic acid probe based on all or part of the nucleic acid sequences as 

10 shown in Figure 1 (SEQ.ID.NO.:l), and using this labelled nucleic acid 
probe to screen an appropriate DNA library (e.g. a cDNA or genomic DNA 
library). For example, a genomic library can be used to isolate a DNA by 
screening the library with the labelled probe using standard techniques. 
Nucleic acids isolated by screening of a cDNA or genomic DNA library can 

15 be sequenced by standard techniques. 

An isolated nucleic acid molecule of the invention which is 
DNA can also be isolated by selectively amplifying a nucleic acid using the 
polymerase chain reaction (PCR) methods and cDNA or genomic DNA. It 
is possible to design synthetic oligonucleotide primers from the nucleic 

20 acid sequence as shown in Figure 1 (SEQ.ID.NO.:l), for use in PCR. A 
nucleic acid can be amplified from cDNA or genomic DNA using these 
oligonucleotide primers and standard PCR amplification techniques* The 
nucleic acid so amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. It will be appreciated that cDNA 

25 may be prepared from mRNA, by isolating total cellular mRNA by a 
variety of techniques, for example, by using the guanidinium-thiocyanate 
extraction procedure of Chirgwin et aL, Biochemistry, 18, 5294-5299 (1979). 
cDNA is then synthesized from the mRNA using reverse transcriptase (for 
example, Moloney MLV reverse transcriptase available from Gibco/BRL, 

30 Bethesda, MD, or AMV reverse transcriptase available from Seikagaku 
America, Inc., St. Petersburg, FL). 



WO 00/77247 



• 

PCT/CA0O/0O716 



-7- 

An isolated nucleic acid molecule of the invention which is 
RNA can be isolated by cloning a cDNA of the invention into an 
appropriate vector which allows for transcription of the cDNA to produce 
an RNA molecule. For example, a cDNA can be cloned downstream of a 
5 bacteriophage promoter, (e.g., a T7 promoter) in a vector, cDNA can be 
transcribed in vitro with T7 polymerase, and the resultant RNA can be 
isolated by standard techniques. 

A nucleic acid molecule of the invention may also be 
chemically synthesized using standard techniques. Various methods of 

10 chemically synthesizing polydeoxynucleotides are known, including solid- 
phase synthesis which, like peptide synthesis, has been fully automated in 
commercially available DNA synthesizers (See e.g., Itakura et al. U.S. 
Patent No. 4,598,049; Caruthers et aL U.S. Patent No. 4,458,066; and Itakura 
U.S. Patent Nos. 4,401,796 and 4,373,071). 

15 The sequence of a nucleic acid molecule of the invention may 

be inverted relative to its normal presentation for transcription to produce 
an antisense nucleic acid molecule. Preferably, an antisense sequence is 
constructed by inverting a region preceding the initiation codon or an 
unconserved region. In particular, the nucleic acid sequences contained in 

20 the nucleic acid molecules of the invention or a fragment thereof, 
preferably a nucleic acid sequence shown in Figure 1 (SEQ.ID.NO.:l) may 
be inverted relative to its normal presentation for transcription to produce 
antisense nucleic acid molecules. 

The antisense nucleic acid molecules of the invention or a 

25 fragment thereof, may be chemically synthesized using naturally occurring 
nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of 
the duplex formed with mRNA or the native gene e.g. phosphorothioate 
derivatives and acridine substituted nucleotides. The antisense sequences 

30 may be produced biologically using an expression vector introduced into 
cells in the form of a recombinant plasmid, phagemid or attenuated virus 
in which antisense sequences are produced under the control of a high 
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efficiency regulatory region, the activity of which may be determined by 
the cell type into which the vector is introduced* 
II. USES OF THE MARKER 

As previously mentioned, the isolation of the novel marker 
5 for E. coli serotypes 0157:H7; 0157:NM and OS5:H7 allows the 
development of diagnostic assays that can be used to detect the serotypes in 
a sample. The sample can be any sample, including but not limited to, 
clinical, food, water and environmental samples. Clinical samples include 
bodily materials such as blood, urine, serum, tears, saliva, feces, tissues and 
10 the like. 

Accordingly, the present invention provides a method of 
detecting the presence of E. coli serotypes 0157:H7; 0157:NM or 055:H7 in 
a sample comprising (a) isolating nucleic acid from the sample and (b) 
determining if the isolated nucleic acid contains (1) a nucleic sequence 

15 shown in Figure 1 (SEQ.ID.NO.rl) wherein T can also be U or a diagnostic 
fragment thereof; (2) a nucleic sequence that is complimentary to the 
1 sequence shown in Figure 1 (SEQ.ID.NO.rl) or a diagnostic fragment 
thereof; (3) a nucleic sequence that can hybridize to the sequence shown in 
Figure 1 (SEQ.ID.NCX:1) or a diagnostic fragment thereof; (4) a nucleic acid 

20 sequence which has substantial sequence homology to (1), (2) or (3); or (5) a 
nucleic acid sequence which is an analog of any of the nucleic acid 
sequences of (1) to (4), wherein the presence of a nucleic acid sequence 
defined in (1) to (5) indicates the presence of E. coli serotypes 0157:H7; 
0157:NM or 055:H7 in the sample. 

25 (a) Primers 

The present invention includes the preparation of nucleic 
acid primers based on the sequence of the marker shown in Figure 1 
(SEQ.ID.NO.rl). Accordingly, the present invention provides an isolated 
nucleic acid primer comprising a portion of a (1) a nucleic sequence shown 
30 in Figure 1 (SEQ.ID.NCX:1) wherein T can also be U or a diagnostic 
fragment thereof; (2) a nucleic sequence that is complimentary to the 
sequence shown in Figure 1 (SEQJD.NO.:!) or a diagnostic fragment 
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thereof; (3) a nucleic sequence that can hybridize to the sequence shown in 
Figure 1 (SEQ.ID*NO.:l) or a diagnostic fragment thereof; (4) a nucleic acid 
sequence which has substantial sequence homology to (1), (2) or (3); or (5) a 
nucleic acid sequence which is an analog of any of the nucleic acid 
5 sequences of (1) to (4). Preferably, the primer contains from about 5 to 
about 50 nucleotides, more preferably from about 15 to 30 nucleotides. 

The length and bases of primers are selected so that they will 
hybridize to different strands of the sequence as shown in Figure 1 
(SEQ.ID.NO.rl) and at relative positions along the sequence such that an 

10 extension product synthesized from one primer when it is separated from 
its template can serve as a template for extension of the other primer into 
a nucleic acid of defined length* Primers which may be used in the 
invention are oligonucleotides, i.e., molecules containing two or more 
deoxyribonucleotides of the nucleic acid molecule of the invention which 

15 occur naturally as in a purified restriction endonuclease digest or are 
produced synthetically using techniques known in the art such as for 
example phosphotriester and phosphodiester methods (See Good et ah 
NucL Acid Res 4:2157, 1977) or automated techniques (See for example, 
Conolly, B.A. Nucleic Acids Res. 15:15(7): 3131, 1987). The primers are 

20 capable of acting as a point of initiation of synthesis when placed under 
conditions which permit the synthesis of a primer extension product 
which is complementary to a DNA sequence of the invention, i.e., in the 
presence of nucleotide substrates, an agent for polymerization such as 
DNA polymerase and at suitable temperature and pH. Preferably, the 

25 primers are sequences that do not form secondary structures by base 
pairing with other copies of the primer or sequences that form a hair pin 
configuration. 

It will be appreciated that the primers may contain 
non-complementary sequences provided that a sufficient amount of the 
30 primer contains a sequence which is complementary to a nucleic acid 
molecule of the invention or oligonucleotide fragment thereof, which is 
to be amplified. Restriction site linkers may also be incorporated into the 
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primers allowing for digestion of the amplified products with the 
appropriate restriction enzymes facilitating cloning and sequencing of the 
amplified product. 



5 618 of the sequence shown in Figure 1 (SEQ.ID.NO.:l). In another 
embodiment, the primer comprises the reverse complement of 
nucleotides 1136-1155 of the sequence shown in Figure 1 (SEQ.ID.NCX:!). 
In a specific embodiment, the primer is 0157-F and has the sequence 5- 
CGGTTTAATGGCTTGTTGTGCTX3' (SEQ.ID.NO.:3). This primer 

10 corresponds to the sequence at nucleotides 597-618 of SEQ.ID.NO.:l except 
that the G residue at position 617 has been replaced with C in the primer. 
In another specific embodiment, the primer is 0157-R and has the 
sequence 5'-ATGCCATTAAACCGGTGGC-3' (SEQ.ID.NO.:4). This primer 
corresponds to the reverse complement of the nucleotides found at 

15 positions 1136-1155 of SEQ.ID.NO.il. These are shown in Table 3. 



inventors have demonstrated that these primers are specific for the E. coli 
serotypes 0157:H7; 0157:NM and 055:H7 but are not specific for 119 other 
E. coli strains belonging to 60 serotypes and 59 isolates belonging to 44 non- 
20 E. coli species (see Example 1). 



detecting the presence of E. coli serotypes 0157:H7; 0157:NM and 055:H7 
in a sample comprising (a) isolating nucleic acid from the sample; 
(b) amplifying the isolated nucleic acid with a primer comprising a 

25 sequence that is complimentary to a portion of (1) a nucleic sequence 
shown in Figure 1 (SEQJD.NO.:l) wherein T can also be U or a diagnostic 
fragment thereof; (2) a nucleic sequence that is complimentary to the 
sequence shown in Figure 1 (SEQTD.NO.tl) or a diagnostic fragment 
thereof; (3) a nucleic sequence that can hybridize to the sequence shown in 

30 Figure 1 (SEQ.ID.NO.:l) or a diagnostic fragment thereof; (4) a nucleic acid 
sequence which has substantial sequence homology to (1), (2) or (3); or (5) a 
nucleic acid sequence which is an analog of any of the nucleic acid 



In one embodiment, the primer comprises nucleotides 597- 



Using the primers illustrated in Table 3 in a PCR assay, the 



Accordingly, the present invention provides a method of 
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sequences of (1) to (4); and (c) assaying for amplified sequences, wherein 
the presence of an amplified sequence indicates that the sample contains 
one or more of E. coli serotypes 0157:H7; 0157:NM and 055:H7. Preferably, 
a primer pair is used, one with a sequence shown in SEQ*ID.NO.:3 and the 
5 other with a sequence shown in SEQ.ID.NO.:4. 

The nucleic acid sequences may be amplified in step (b) using 
any method that results in the amplification of nucleic acid molecules 
such as the Polymerase Chain Reaction, Ligase Chain Reaction or NASBA 
may be used to amplify a nucleic acid molecule of the invention (Barney 

10 in "PCR Methods and Applications" , August 1991, Vol.l(l), page 5, and 
European Published Application No. 0320308, published June 14, 1989, and 
U.S. Serial No. 5,130,238 to Malek). 

Preferably, the nucleic acid sequences are amplified in step (b) 
using a Polymerase Chain Reaction (PCR). The conditions which may be 

15 employed in the methods of the invention using PCR are those which 
permit hybridization and amplification reactions to proceed in the 
presence of DNA in a sample and appropriate complementary 
hybridization primers. Conditions suitable for the polymerase chain 
reaction are generally known in the art. For example, see M.A. Innis and 

20 D.H. Gelfand, PCR Protocols, A guide to Methods and Applications M.A. 
Innis, D.H. Gelfand, J J. Sninsky and TJ. White eds, pp3-12, Academic Press 
1989, which is incorporated herein by reference. Preferably, the PCR 
utilizes polymerase obtained from the thermophilic bacterium Thermus 
aquatics (Taq polymerase, GeneAmp Kit, Perkin Elmer Cetus) or other 

25 thermostable polymerase may be used to amplify DNA template strands. 

The amplified products can be isolated and distinguished 
based on their respective sizes using techniques known in the art. For 
example, after amplification, a DNA sample can be separated on an agarose 
gel and visualized, after staining with ethidium bromide, under ultra 

30 violet (uv) light. DNA may be amplified to a desired level and a further 
extension reaction may be performed to incorporate nucleotide derivatives 
having detectable markers such as radioactive labelled or biotin labelled 
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nucleoside triphosphates. The primers may also be labelled with 
detectable markers as discussed below. The detectable markers may be 
analyzed by restriction and electrophoretic separation or other techniques 
known in the art. 

5 The primers may be labelled with detectable markers which 

allow for detection of the amplified products. Suitable detectable markers 
are radioactive markers such as P-32, S-35, 1-125, and H-3; luminescent 
markers such as chemiluminescent markers, preferably luminol; and 
fluorescent markers, preferably dansyl chloride, fluorcein-5-isothiocyanate, 
10 and 4~fluor-7-nitrobenz-2-axa-l,3 diazole, Cy3, Cy5, Texas Red, NED; 
enzyme markers such as horseradish peroxidase, alkaline phosphatase, 
(3-galactosidase, acetylcholinesterase, or biotin. 
(b) Probes 

The present invention also includes the preparation of 
15 nucleic acid probes based on the sequence of the marker shown in Figure 1 
(SEQ.ID.NCX:1). Accordingly, the present invention provides an isolated 
nucleotide probe comprising a portion of (1) a nucleic sequence shown in 
Figure 1 (SEQ.ID.NO.rl) wherein T can also be U or a diagnostic fragment 
thereof; (2) a nucleic sequence that is complimentary to the sequence 
20 shown in Figure 1 (SEQ.ID.NO.:l) or a diagnostic fragment thereof; (3) a 
nucleic sequence that can hybridize to the sequence shown in Figure 1 
(SEQ.ID.NO.rl) or a diagnostic fragment thereof; (4) a nucleic acid sequence 
which has substantial sequence homology to (1), (2) or (3); or (5) a nucleic 
acid sequence which is an analog of any of the nucleic acid sequences of (1) 
25 to (4). 

Preferably, the probe comprises a portion of the sequence 
shown in Figure 1 (SEQ.ID.NO.:l). More preferably, the probe comprises 
from about 20 to about 500 nucleotides, even more preferably 50 to 200 
nucleotides from the sequence shown in Figure 1 (SEQ.ID.NO.rl). 
30 In one embodiment, the probe comprises nucleotides 597-677 

of the sequence shown in Figure 1 (SEQ.ID.NO.:l). Preferably, the probe is 
0157-A and has the sequence CGGTTTAATGGCTTGTTGTGGTAA 
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C ACCG A AGCC AG CTCA AT A A ATTGCT GCGATGAG TT AC 
AGCTATCGAGTAAACCACC (SEQJD.NCX:5). 

In another embodiment, the probe comprises nucleotides 
1059-1155 of Figure 1 (SEQJD.NO.:l). Preferably, the probe is 0157-B and 
5 has the sequence TATCCCGAATCGCCTGGCGTTTTTGCACATC 
CTCTGCGACGCTATTTTTGTGGAACGCAAAGCCATCAAGGAAAATA 
GCCA CCGGTTTAAT GGCAT (SEQ.ID.NO.:6). 

The probes can be used to detect the presence or absence of E. 
coli serotypes 0157:H7; 0157:NM and 055:H7 in a sample. 

10 Accordingly, the present invention also relates to a method of 

detecting the presence of a nucleic acid molecule associated with E. coli 
serotypes 0157:H7; 0157:NM and 055:H7 in a sample comprising 
(a) contacting the sample under hybridization conditions with one or 
more nucleotide probes which hybridize to the nucleic acid molecules and 

15 (b) determining if there is hybridization between the nucleic acid 
molecules in the sample and the nucleotide probes wherein the presence 
of hybridization indicates that the sample contains one of E. coli serotypes 
0157:H7; 0157:NM and 055:H7. 

Hybridization conditions which may be used in the methods 

20 of the invention are known in the art and are described for example above 
in the definition of "nucleic acid sequences that hybridize". The 
hybridization product may be assayed using techniques known in the art. 
The nucleotide probe may be labelled with a detectable marker and the 
hybridization product may be assayed by detecting the detectable marker or 

25 a detectable change produced by the detectable marker. 

The detectable marker used to label the probe can be any 
marker such as a radioactive label which provides for an adequate signal 
and has sufficient half life such as 32 P, 3 H, 14 C or the like. Other detectable 
markers which may be used include antigens that are recognized by a 

30 specific labelled antibody, fluorescent compounds, enzymes, antibodies 
specific for a labelled antigen, and chemiluminescent compounds. An 
appropriate label may be selected having regard to the rate of hybridization 
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and binding of the probe to the nucleotide to be detected and the amount 
of nucleotide available for hybridization, 
(c) Microchips 

The present invention also includes microchips comprising 
5 (a) an isolated nucleic acid molecule comprising (1) a nucleic sequence 
shown in Figure 1 (SEQJD.NO.:!) wherein T can also be U or a diagnostic 
fragment thereof; (2) a nucleic sequence that is complimentary to the 
sequence shown in Figure 1 (SEQ.ID.NO.:!) or a diagnostic fragment 
thereof; (3) a nucleic sequence that can hybridize to the sequence shown in 
10 Figure 1 (SEQJD.NO.:l) or a diagnostic fragment thereof; (4) a nucleic acid 
sequence which has substantial sequence homology to (1), (2) or (3); or (5) a 
nucleic acid sequence which is an analog of any of the nucleic acid 
sequences of (1) to (4) attached to (b) a microchip. 

In one embodiment, the microchip comprises a probe having 
15 the sequence CGGTTTAATGGCTTGTTGTGGTAA 
CACCG A A GCCAG CTCA AT A A ATTGCT GCGATGAGTT AC 
AGCTATCGAGTAAACCACC (SEQ.ID.NO.:5) attached to a microchip. 

In another embodiment, the microchip comprises a probe 
having the sequence TATCCCGAATCGCCTGGCGTTTTTGCACATC 
20 CTCTGCGACGCTATTTTTGTGGAACGCAAAGCCATCAAGGAAAATA 
GCCA CCGGTTTAAT GGCAT (SEQ.ID.NO.:6) attached to a microchip. 

As described in Example 2, microchips containing the probes 
shown in SEQJD.NO.:5 and SEQ.ID.NO.:6 were useful in detecting E. Coli 
serotype 0157:H7. 
25 (d) Kits 

Reagents suitable for conducting the above described 
diagnostic methods of the invention may be packaged into convenient kits 
providing the necessary materials, packaged into suitable containers. Such 
kits may include all the reagents required to detect an E. coli serotype in a 
30 sample by means of the methods described herein, such as appropriate 
probes and /or primers for performing PCR. 
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In one embodiment of the invention the kit contains a 
nucleotide probe which hybridizes with a nucleic acid molecule of the 
invention, reagents required for hybridization of the nucleotide probe 
with the nucleic acid molecule, and directions for its use. In another 
5 embodiment of the invention, the kit includes primers which are capable 
of amplifying a nucleic acid molecule of the invention or a predetermined 
oligonucleotide fragment thereof, all the reagents required to produce the 
amplified nucleic acid molecule or predetermined fragment thereof in the 
polymerase chain reaction, and means for assaying the amplified 
10 sequences. 

As mentioned above, the methods and kits of the present 
invention have many practical applications. For example, the methods 
and kits of the present invention may be used to detect E. Coli serotypes 
0157:117, 0157;NM or 055:H7 in any sample suspected of containing E. coli. 
15 Samples which may be tested include bodily materials such as blood, 
urine, serum, tears, saliva, feces, tissues and the like. Further, water and 
food samples and other environmental samples and industrial wastes may 
be tested. 

Before testing a sample in accordance with the methods 
20 described herein, the sample may be concentrated using techniques known 
in the art, such as centrifugation and filtration. For hybridization and /or 
PCR-based methods described herein, nucleic acids may be extracted from 
cell extracts of the test sample using techniques known in the art such as 
precipitation, solvent extraction and column purification. 
25 The following non-limiting examples are illustrative of the 

present invention: 

EXAMPLES 

Example 1 

AFLP marker identification 
30 The marker was initially identified as a 360 bp fluorescent 

fragment (Figure 2 or SEQ.ID.NO.:2) using the FAFLP method (Vos et al., 
1995. Nucleic Acids Research, Vol. 23, pp. 4407-4414). The FAFLP analysis 
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was performed by the selective amplification of restriction fragments from 
a digest of total genomic DNA with restriction enzymes EcoR I and Mse I. 
The DNA sequences of the selective primer pair that produced the 
polymorphic marker are as shown in Table 1 and in SEQ.ID.NOS:7 and 8. 
5 Among 163 E. coli strains tested that belonged to 40 serotypes (Table 2), 
only serotypes 0157:117, 0157:NM, 0145:NM and 055:H7 contained the 360 
bp fragment. 
DNA sequencing 

The target fragment was directly isolated from the AFLP gel 

10 using ABI 377 automated DNA Sequencer and re-amplified by polymerase 
chain reaction (PCR), and then sequenced. The DNA sequence of the 360 
bp fragment is shown in Figure 2 (SEQ.ID.NO.:2). A 1223 bp downstream 
region continued from the AFLP fragment was further sequenced using a 
single primer walking method with strains of E. coli serotypes 0157:117, 

15 0157:NM, 055:H7 and 0145:NM. Four mutations (C-T, A-G, T-A and G-A) 
in the region were identified in the strain of serotype 0145:NM. The 
complete sequence of the 1583 bp fragment is shown in Figure 1 
(SEQJD.NO.:l). The result of BLAST search showed that the DNA 
sequences of the 1583 bp fragment had no significant homology to 400 

20 sequences of the E. coli genome and 400,635 known sequences in the 
GenBank, EMBL, DDBJ, PDB DNA databases. The DNA sequences of the 
1583 bp fragment were also analyzed by a software program, Gene Works 
2.5; a potential open reading frame (ORF) was identified within the 
sequences of 1043 - 90 nt. 

25 Marker application 

The DNA fragment has been useful as a marker in 
developing a PCR assay for specific detection of E, coli serotypes 0157:H7, 
0157:NM and 055:H7. The DNA sequences of the primers used in the 
PCR assay are provided in Table 3. Primer 0157-F contains one mismatch 

30 to the sequences of 0157:H7/0157:NM for eliminating possible cross 
reaction from the sequences of 0145:NM. The primer pair results in a 560 
bp PCR product under the following conditions. An amplification 
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reaction mixture (20 jllL) contained 10mM Tris-HCl and 50 mM Kcl (pH 
8,3), 1.2 mM MgCl 2 , 200 jxM each of dATP, dCTP, dGTP and dTTP, 0.4 

pmol/fiL each of the primers, 0,5 unit of Taq DNA polymerase 
(Perkin-Elmer) and 5 ul of DNA template. The thermal cycling conditions 
5 (GeneAmp 9600 PCR System, Perkin-Elmer) were as follows: initial 
denaturation at 94°C for 3 min; 35 cycles consisting of denaturation at 94°C 
for 30 sec, annealing at 60°C for 30 sec and extension at 72°C for 45 sec; 
incubation at 72°C for 5 min. PCR products were visualized on a 1.6% 
agarose gel after staining with ethidium bromide. 

10 The specificity of the primers and the PCR assay has been 

evaluated with a total of 119 E. coli strains belonging to 60 serotypes (Table 
4) and 59 isolates belonging to 44 non-E. coli species (Table 5). The 560 bp 
specific PCR product appeared only with strains of serotypes 0157:H7, 
0157:NM and 055:H7 (Figure 3) but not the strains of the other serotypes 

15 of E. coli or other bacterial strains tested. The PCR assay can be used for the 
detection of E.coli 0157:H7 in food, clinical, water and environmental 
samples. 
Example 2 

Preparation of a Microchip 
20 MATERIALS AND METHODS 
DNA microchip system 



Lab /Stanford System that was assembled at Virtek Vision (Waterloo, 
Ontario) and can produce 1000 to 2500 spots/cm 2 . The DNA scanner, 
25 ChipReader™ (with data collection software) was from Virtek Vision. 
The software ImaGene was the product of BioDiscovery and contributed by 
Virtek. 

DNA probe preparation 



30 attached to the chip surface and "target'' in the fluorescent molecules that 
hybridize to this probe. 



The arrayer (with operation software) was a Brown 



In this Example, "probe" refers to the DNA covalently 
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The probes for E, colt serotype 0157:H7 were designed 
according to the sequences of the 0157:H7 marker and are illustrated in 
Table 6. Short (about 20 bp) probes were prepared by direct synthesis and 
5'-end labeling with amine. Other probes were prepared by PCR using the 
5 DNA samples from the control £. colt strains and specific primers with 
one of them labeled with 5'-amine. 

PCR reaction mixtures (50|iiL) contain lx amplification Buffer 
II (Perkin-Elmer), 200 fiM dNTPs, 2 mM mgCl 2 , 0.5 jliM forward and 
reverse primers, 1 unit of AmpliTag DNA polymerase (Perkin-Elmer) and 

10 lOfiL of template DNA. Amplification for probe preparation consisted of 
incubation at 95°C for 1 min. 32 cycles of 30s at 94°C, 30s at 52°C and 1 min 
at 72°C, and extension at 72°C for 7 min. PCR products were visualized 
using ethidium bromide-stained agarose gels, purified using QIAquick 
PCR purification Kit (Qiagen) and dissolved in 3x SSC buffer for spotting. 

15 Probe spotting and immobilization on the chips 

The purified DNA probes with amine-modified tags were 
spotted on a glass microscope slide that is coated with aldehyde groups 
(Telechem International, Inc.). The DNA probes were immobilized onto 
the glass surface by covalent binding via the Schiff's base attachment* 

20 Spotting was carried out with the Stanford arrayer system. This robot 
system is designed to automatically collect samples (DNA probes) from 
either 96- or 384-well microtitre plates, and to spot the samples onto up to 
16 slides simultaneously. The probes were spotted with a single pin at a 

density of 2500 spots/cm 2 (200 |nm spacing). After spotting, the slides were 
25 rehydrated in a humidity chamber for 2-4 h, and rinsed once with 0.2% 
SDS and twice with water. The slides were then incubated for 5 min in a 
sodium borohydride solution (1.0%). The probes (if they were PCR 
products) were denatured by heating the slides at 95°C for 2 min, rinsing 
once with 0.2% SDS and twice with water. The slides were dried and 
30 stored at room temperature until use. 
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Fluorescent target DNA preparation 

Fluorescent target DNAs were prepared by PCR from the 
control strains of bacteria with specific primers. The PCR reaction 
conditions were same as those used in the probe preparation except the 
5 following modifications: the primers were not labeled; 50% of the normal 
dCTP was substituted with 500 \iM of fluorescent Cy3-dCTP (Amersham 
Pharmacia Biotech) in the PCR reaction mixtures. The labeled target 
DNAs were purified with MicroSpin columns (Amersham Pharmacia 
Biotech) and stored at -20°C until use. 
10 DNA hybridization 

The immobilized probes were pre-hybridized with 1 jag of 

salmon sperm DNA in 20 |aL of the DIG Easy Hyb solution (Boehringer 
Mannheim) at 65°C (37°C for small fragments) for 1 h. The target DNAs 
(10 liL) were denatured at 95°C for 2 min, and mixed with 10 jiL of the DIG 

15 Easy Hyb solution and applied onto the processed slide with a coverslip. 
The slides were placed in a humid chamber and incubated overnight at 
37°C or 62°C. Following hybridization, the slides were washed with the 
washing buffers with different stringency for four times and dried at room 
temperature. 

20 Data collection and analysis 

Hybridization patterns were scanned using the ChipReader™ 
(Virtek). Data were collected using the ChipReader™ collection software 
(Virtek) at 100% laser intensity, 700-900 detector sensitivity, 2 detector gain 
on the Cy-3 channel. Fluorescent images were analyzed using the 

25 ImaGene™ software (BioDiscovery). 
Results 

The marker of the invention has been demonstrated to be 
useful in developing a DNA microarray/chip-based assay for specific 
detection of the E. coli serotypes 0157:H7, 0157:NM and 055:H7 in a 
30 sample. Table 6 shows the sequences of the two DNA probes that were 
designed based on the marker. The DNA probes suitable for spotting were 
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generated using PCR and tagged with amine at their 5' end. The probes 
were used together with other 22 probes and immobilized onto the 
microscopic slide surface by covalent binding via Schiff's base attachment. 
The microarray was used in hybridization with Cy3 labeled target DNA 
5 prepared from 10 different reference strains of E. coli. The inventors have 
shown that the two specific probes on the chip specifically hybridized to 
the strains of E. colt serotype 0157:H7. 

While the present invention has been described with 
reference to what are presently considered to be the preferred examples, it 

10 is to be understood that the invention is not limited to the disclosed 
examples. To the contrary, the invention is intended to cover various 
modifications and equivalent arrangements included within the spirit and 
scope of the appended claims. 

All publications, patents and patent applications are herein 

15 incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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Table 1 

DNA sequences of the selective primers that produced the AFLP marker 
Primer Oligonucleotide sequence 

EcoRI-C 5'-GACTGCGTACCAATTCC-3' (SEQ.ID.NO.:7) 

Msel-G 5-GATGAGTCCTGAGTAAG-3' (SEQ.ID.NO.:8) 
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Table 2 

E. coli strains screened for the presence 
of the 0157:H7 marker in AFLP analysis 



Serotype 


No. of strains 


Serotype 


No. of strains Serotype No. 
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Table 3 

DNA sequences of the primers specific for 
R coli 0157:H7, 0157:NM and 055:H7 



Primer Oligo-nucleotide sequence Location within the 

marker sequence 



0157-F (22-mer) 5 , ~CGGTTTAATGGCTTGTTGTGCT-3 , 597 - 618 (SEQ.ID.NCX:3) 

0157-R (19-mer) 5-ATGCCATTAAACCGGTGGC-3' 1136 - 1155 (SEQ.ID.NO.:4) 
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Table 4 

E. coli strains tested for the presence of the 
0157:H7 marker in the PCR assay 



Serotype 


No. of strains 


Derotype 


i\io. or strains 


Serotype No, of strains 
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TableS 

Non-E, coli species tested for the presence 
of the 0157:H7 marker in the FCR assay 



Species No. of isolates 



Acinetobacter calcoaceticus 1 

Aeromonus hydrophila 2 

Bacillus cereus 2 

Bacillus circuluris 1 

Bacillus parabrevis 1 

Bacillus subtilis 1 

Citrobacter diversus 1 . 

citrobacter freudii 1 

Campylobacter 2 

Campylobacter coli 2 

Campylobacter jejuni 1 

Canadida albicus 1 

Enterobacter cloacae 1 

Enterobacter amnigenus 1 

Enterococcus fecalis 2 

Edwardsieila tarda 1 

Kluyvera ascovbata 1 

Listeria monocytogenes 1 

Listeria ivarovii 1 

Lactobacillus brevis 1 

Lactobacillus planterum 1 

Moraxella osloensis 1 

Proteus vulgaris 1 

Proteus mirabilis 1 

Pseudomonus aeruginosa 1 

Salomonella tryphimurium 4 

Salmonella heidelberg 1 

Salmonella thompson 1 

Salmonella newport 1 

Salmolella hadar 2 

Salmonella infantis 2 

Salmonella schwarzengrund 2 

Salmonella choleraesuis var.kunzendorf 2 

Shigella flexaeri 1 

Shigella sonnei 1 

Staphylococcus aureus 4 

Straphylococcus epidermidis 1 

Staphylococcus haemolyticus 1 

Straphylococcus lugdenersis 1 

Straphylococcus intermedium 1 

Streptococcus pyogenes 1 

Streptococcus bovis 1 

Xanthomonas malitophilia 1 

Yersinia enterocolitica 1 
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Table6 

DNA sequences of the probes specific for 
R coli 0157:H7, 0157:NM and 055:H7 



Probe 



Nucleotide sequence 



Location within the 
marker sequence 



0157-A 



CGGTITAATGGCTTCTTGTG 
GTAACACCGAAGCCAGCTCA 
ATAAATTGCTGCGATGAGTT 
ACAGCTATCGAGTAAACCACC 



597-677 



0157-B 



TATCCCGAATCGCCTGGCGT 
TTTTGCACATC CTCTGCGAC 
GCTATTTTTGTGGAACGCAA 
AGCCATCAAGGAAAATAGCC 
ACCGGTTTAATGGCAT 



1059 - 1155 



